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This report Is an Executive Summary of the Final Report on the Study of 
On-Board Compression of Earth Resources Data. The study was performed by 
TRW for NASA-Ames Research Center under contract NAS2-8394. Only high- 
lights of the study are presented here. The reader is referred to the 
Final Report for more detailed treatments. 

1 . INTRODUCTION 

NASA's LANDSAT (ERTS) program has provided a wide variety of users the opportunity 
to explore the utility of satellite-based earth resources observation. Although the 
full range of applications has not yet been addressed, the potential of such systems to 
earth resource management has been definitely established. Even at this early stage, 
many commercial and governmental agencies have expressed strong interest in the 
operational use of earth observations. The LANDSAT experiment ha$ also shown that 
converting this potential into an operational system in which practical earth resource 
management can be accomplished requires extensive investments in communications, data 
handling, and data processing. These investments are continually increasing along with 
user requirements for increased spatial and spectral resolution, earth coverage, and 
data timeliness which imply sensor output and data processing rates of hundreds of 
megabits per second. 

The objective of on-board data compression (source encoding) is to reduce costs 
and/or systems constraints incurred by high data rates in the sensor-to-ultimate user 
data handling chain. When the data rate and volume are reduced at the sensor, obvious 
benefits result. These include reduced on-board storage, simpler data transmission, 
simpler ground data recording, and fewer data tapes to archive. This is accomplished 
by exploiting statistical dependencies which exist between data samples in order to 
reduce the information rate. 

Data compression has been the subject of much research and a large number of 
papers over many years. In spite of the progress made in this field, few spacecraft 
missions have been able to justify on-board data compression with its attendant risks 
in reliability and data alteration, and its cost in terms of size, weight, and power. 
The situation, however, has changed recently. Digital logic has become smaller, more 
reliable, less expensive, and much less power consuming for a given amount of process- 
ing. It now appears practical to actually implement an operational data compression 
system. 

One of the principal considerations in selecting the most feasible data compres- 
sion technique is the need for broadbased applicability. Earth observation experiments 
are charactized by general purpose sensors which are used by hundreds of investigators 
for many different purposes. Such a variety of users requires that the designer pro- 
vide an information management system as broadly useful as possible within the cost 
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constraints. Achievement of this maximum utility In terms of cost benefits and cost 
avoidance Is a major tradeoff Item. 

1.1 NEED FOR DATA COMPRESSION OF EARTH RESOURCES DATA 

By surveying the almost continuous studies of user needs for earth resources data, 
it Is possible to judge trends In data requirements. One recent survey of these 
studies related data types, such as ocean survey, meteorology, agriculture, forestry, 
geology, and mineral resources to user community uses ranging from sea surface effects 
(temperature, roughness, etc.) to terrain mapping, atmospheric pollution, and severe 
storm warning. Additional factors such as data uses, data destination, data perisha- 
bility, frequency of observation, and resolution can then be used to form candidate 
missions and sensor groupings to meet emerging needs. Figure 1 presents a summary of 
typical sensor/mission groupings. 
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Figure 1. Mission/Sensor Grouping 


The sensors listed are typical of the major classes of earth resources sensors to 

be used in the post 1975 time frame. Among these, the highest data rates are achieved 

by the imaging sensors, which are often mul ti spectral , and the synthetic aperture radar 

One convenient way to look at the data rates of imagers is from the standpoint of the 

user oriented requirements shown in Figure 2. When projected into the 1980's these 

sensors lead to spacecraft missions with instantaneous composite data rates in excess 

12 

of 700 Mbps and data volumes approaching 10 bits per orbit. These high rate require- 
ments are solidly based on known earth resources user community needs. Economic and/or 
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political vicissitudes notwithstanding, the necessity of considering data compression 
tradeoffs In future earth resources mission studies 1$ obvious. This report seeks to 
provide concrete results on multlspectral data compression upon which to base such 
tradeoffs . 



10 100 
GROUND RESOLUTION, R, (METERS) 


1000 


Figure 2. Data Rate vs Ground Resolution 
for Imagery Sensors 


1.2 EXPECTED BENEFITS OF DATA COMPRESSION IN NASA EARTH 

RESOURCES PROGRAMS 

The ultimate aim of data compression is to Increase the cost-effectiveness of data 
management in operating earth resources systems. In most cases, the Incremental cost 
of adding data compression is modest. Benefits, however, are usually realized by 
designing the data management system with integral data compression, not as an add on. 
There is little Impact, for example, in reducing the data rate on an existing com- 
munications link if the released capacity cannot be used elsewhere to reduce cost. 

Cost benefits, in general, are quantitatively related to the compression ratio which is 
achievable within fidelity constraints. Although the exact relationships and demon- 
strabillty of cost savings may be complex, reductions substantially related to the 
compression ratio are expected. Some of the more obvious cases In which data compres- 
sion will yield inmediate benefits to NASA operating data management systems are: 
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• High data rate recording at low cost ground stations 

t Relaying of data at moderate to high rates, via common carriers (e.g., 
domestic satellite) for quick look or fast reaction needs 

• Maintenance of digital data bases on Image analysis computer systems 
(disks, tapes, drums, etc.) 

• Handling, transportat.on, and capital costs associated with digital 
(computer) tape distribution and recovery. 

It must be recognized that the compression scheme used should be carefully 
matched tn the application. General user applications, such as data archiving, will 
usually attain only modest compression (factors of 2 to 5). Specific users, however, 
who require only data subsets with specific characteristics, or who are satisfied with 
some loss of resolution either spectrally or spatially, can realize cost benefit 
ratios of 10 to 100. Achievement of some gains is immediately possible in the former 
case. Information preserving techniques developed under previous NASA studies can be 
implemented to reduce the cost of archiving and recalling data. In this case, special 
care is taken that the cost associated with reconstructing compressed data files is 
minimum because this operation wi i i be repeated often. Compression, on the other hand, 
is performed only once, upon archiving, and can thus be more complex. 

Two examples of data compression benefits to NASA programs are as follows: 

Earth Observatory Satellite 

One feature of this program is the ability to directly transmit imagery data to 
low cost user stations for their own processing. Cost considerations, however, dictate 
a reduced data rate from the 240 Mbps provided to primary stations. A 12 to 1 com- 
pression on-board the spacecraft would allow reception of full resolution data at a 
reduced rate. The only alternatives are various subsampling approaches such as sending 
1/4 resolution in each spatial direction. This decreased resolution would severely 
limit the utility of the imagery to users, even to the point of jeopardizing the prac- 
ticality of low cost stations. 

Goddard Space Flight Center to Sioux Falls Link 

i p 

The EOS system can produce imagery totaling 10' bits per day. After GSFC proc- 
essing, this data must be transmitted to the EOS data center at Sioux Falls, S.D. A 
continuous 10 Mbps link would be required for this transmission for which costs via 
domestic comsat can exceed $1,000,000 per year. A 4 to 1 compression ratio could pro- 
vide approximately $750,000 in annual savings. 
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2. PURPOSE OF THE STUDY 


The purpose of the study was to provide NASA with comprehensive tradeoff data and 
specific recommended methods of on-board data compression for use in planning and con- 
figuring both present and future earth resources programs. This study forms a natural 
second step to previous NASA funded studies of data compression techniques for multi- 
spectral imagery which were more limited in scope and considered only a few specific 
algorithms. General Electric and Philco-Ford restricted their investigations co 
rather elementary schemes which were verv simply implementable for on-board use. 

Purdue University considered more complex (transform) techniques and demonstrated their 
performance on both aircraft and satellite multi spectral scanner data. TRW Systems and 
Purdue Un versity performed parallel studies under the ERTS (now LANDSAT) program to 
compress multispectral scanner imagery using errorless codes for both archiving and 
on-board application. 

These studies exhibited a number of very basic differences. For one, they were 
forced to use different data bases due to the sequencing of NASA programs and thus a 
meaningful comparison of results was difficult. Furthermore, they did not, in general, 
include important systems parameters such as the effects of channel noise and sensor 
peculiarities on individual algorithm performance. And finally, the use of differing 
criteria of optimality and fidelity caused difficulty in applying their results to 
on-going NASA programs. 

The present study is much more comprehensive and unified. It applies a common 
set of data, criteria, and system parameters and considers a broad range of data com- 
pression techniques. The aim was to find the techniques most suitable for multispectral 
imagery and which inject minimum distortion as measured by a variety of fidelity cri- 
teria. Most unique among the criteria was classification accuracy which recognized 
that many end users employ only specific processing of their data products. Subjective 
evaluation of typical imagery products was also used. An additional objective was to 
obtain parametric results which could be used by NASA in future program planning. 

This included a requirement to deliver all computer simulations to both the Ames and 
Marshall NASA Centers. 



3. STUDY APPROACH 


The basic study approach taken Is summarized In Figure J. The study began with 
a survey of the current literature or imane bandwidth compression and selection of 
those methods relevant to compressl' jltispectral imagery. Typical satellite 
multispectral data was then analy 2 . . .istically and the results used to select a 
smaller set of candidate bandwidth compression techniques particularly relevant to 
earth resources data. These were compared using both theoretical analysis and 
simulation, under various criteria of optimality such as mean square error (MSE), 
signal-to-noise ratio, classification accuracy, and computational complexity. By 
concatenating some of the most promising techniques, three multispectral data com- 
pression systems were synthesized which appear well suited to current and future 
NASA earth resources applica cions. The performance of these three recomnended systems 
was then examined in detail by all of the above criteria. Finally, merits and 
deficiencies were summarized and a number of recommendations for future NASA activities 
in data compression proposed. 
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Figure 3. Flow Diagram of the Study 
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Several comnents are relevant to assumptions and limitations pertinent to this 
study: 

(1) Only two sets of multlspectral data were used In analysis and simulation of 
the various data compression algorithms. One four-band LANDSAT MSS Image was relied 
upon heavily In arriving at data statistics and determining relative performance. This 
scene (Imperial Valley, California), which is quite representative of earth resources 
data, has been Intensively studied by other investigators, and contains a rich mixture 
of high and low detail, brightness levels, and colors. A second, entirely different 
type, 12-channel multi spectral image was used to verify the generality of major 
results and conclusions. It was supolied by the Laboratory for Applications of 
Remo! Sensing (LARS). 

(*; The study considers largely nonadaptive, rather than adaptive, methods for 
compressing multlspectral image data. Nonadaptive image coding methods are better 
established and tested, are easier to implement, and in general are less data 
dependent. However, in the past year (1975) a number of adaptive algorithms for cod- 
ing images have been developed which deserve further consideration in specific appli- 
cation studies. 

(3) The study includes computational complexity of the various compression 
methods and presents tables for comparison in terms of number of arithmetic operations 
and storage requirements. However, hardware implementation tradeoffs specifying 
design complexity were considered beyond the scope of this study. 


4. MAJOR CONSIDERATIONS IN DATA COMPRESSION 
FOR EARTH RESOURCES DATA 

4.1 NATURE OF MULTI SPECTRAL IMAGE DATA 

As indicated above, this study concentrated on the compression of multispectral 
imagery because, of all earth resources data types, it has by far the greatest poten- j 
tial for cost savings. This results from the high data rates and volumes involved. 
Figure 4 illustrates typical multispectral image digital data as produced by a multi- 
spectral scanner (MSS). Each spectral band is a separate image of the same exact 
ground area made up of discrete sampled points called picture elements (pixels). 

Pixels corresponding to the same ground point are precisely aligned from one spectral 
image to another as indicated. This registration is inherent in most MSS's and is 
important for good data compression because it maximizes the correlation of data 
intensity between adjacent pixels. 
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Figure 4. Illustration of Typical Multlspectral Image Data 

A typical Image size In earth resources applications Is 4000 x 4000 pixels (which 
In LANDSAT was some 100 nml x 100 nmi). If each pixel Is quantized to 8-blt accuracy 
and there are four bands, one multlspectral Image requires 4000 x 4000 x 8 x 4 - 512 
million bits for full representation. Note that each ground element can be considered 
as represented by a four element vector which Is a unique point In four-dimensional 
multlspectral space. Many processing and compression schemes involve manipulation of 
these vectors. 

4.2 CRITERIA FOR COMPARING BANDWIDTH COMPRESSION TECHNIQUES 

For data compression to be possible, one or both of two conditions must te satis- 
fied. The first is that the data have some- structure or redundancy. By knowing some- 
thing about the data structure, data compression can be accomplished without losing any 
of the information the data contains. Data compression that relies solely on knowledge 
of data structure is called information preserving or error free data compression. The 
second condition is that the image user have interest in only limited accuracy or a 
particular aspect of the data collected. In this case, the data of low value to the 
user can be discarded, resulting In data compression. Since information discarded is 
permanently lost, compression based on user interest is called information destroying 
data compression. 

Information discarded by the compression process causes differences in the 
original and reconstructed images. The fundamental theoretical problem in data com- 
pression is to choose a technique which drives this distortion as low as possible at 
a given transmission rate. However other factors, such as susceptibility to channel 
errors and complexity of the compressor, are also important in practical systems. 
Several pertinent comparison criteria are discussed below. 
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The clustering method serves well for measurir how well classification accuracy 
is preserved after data compression. To be fair, we assume that the original image 
and the reconstructed image are clustered separately, but by the same process. Call 
the first space of undistorted (precompression) measurements X, and the distorted 
(compressed and reconstructea) measurements Y. For the classification accuracy to be 
preserved, the two partitions should be such that if y is a distorted value corre- 
sponding with x, then the category to which y belongs should be the same as the cate- 
gory from which x came. From Figure 5, it is seen that class 1 and class 2 can be 
distinguished equally well in the space X and the space Y with the appropriate parti- 
tions. Unde»" certain distortions D, however, it may not be possible to preserve class- 
ification accuracy. The percentage of picture elements classified differently in the 
original image and the reconstructed image is used as the classification inaccuracy 
for the particular bandwidth compress - ' on method. 


x Y 



Figure 5. Partitioning Data in Two Classes Before 
and After Distorting the Data 


Implementation Complexi ty 

Once it is shown that several compression techniques can signficantly reduce the 
required data rate without introducing unacceptable distortion, it is necessary to 
compare their implementation complexity. Compressor complexity is a particularly 
significant factor for spacecraft applications. Reconstruction complexity on the 
ground has less effect on overall system costs. From a strictly algorithmic ’eve , 
complexity may be measured by the number of computations per pixel. However, a break- 
down by arithmetic operations, such as add/subtract, multiply/divide, and memory 
operations, is required for closer comparison of compression techniques. Data storage 
requirements are often the significant complexity factor. It should be noted that 
both memory and computation rates are sensitive to the particular mul tispectral 
scanner assumed. 


PRECEDING PAOft BT.ANTC NOT FUJiRO 


- 10 - 



Sensitivity to Sensor Effects 


Certain properties of the detectors used In multispectral sensors affect the data 
structure and thus in turn interact with compression performance. The specific distor- 
tion introduced varies with the compression technique. As a typical Imaging sensor to 
be used in the next decade, consider the thematic mapper projected for NASA's Earth 
Observatory Satellite (EOS) program. It Is a multispectral scanner using photodiodes 
as transducers and a moving mirror to scan lines perpendicular to the satellite's 
f rack. The properties that cause distortion during data compression are nonumformity 
and nonlinearity of the photodiodes' gain response and spectral misregistration due to 
the optics and scanning patterns. In this study, these variations are modeled sta- 
tistically and then related to increased mean square error. 

4.3 BANDWIDTH COMPRESSION METHODS FOR MULTISPECTRAL IMAGERY 

Many techniques exist for compressing monochrome images and chey are richly 
described the literature. Since a multispectral image can be thought of as N mono- 
chro le inaues v for N spectra 1 bands), one approach is to compress each monochrome 
image sepa.ately by any one of the standard techniques. This tends, however, to 
ignore any redundancy existing in the spectral domain. Suppose that the brightnesses 
recorded in different bands were correlated. Then the jistribution of vectors would 
lie primarily in some hyperplane in the multispectral space, S. This, in turn, would 
imply that the data has structure which can be taken advantage of for data compression, 
since the hyperplane has a lower dimension than the original space. Transformation to a 
new basis vector set is often used for thi > purpose. In its optimum form, the trans- 
formation finds an ordered set of basis vectors (called principal components) for S 
which have the property that the first basis vector points in the direction of the 
greatest data variance, the second basis vector points in that direction orthogonal to 
the first basis vector having the next greatest variance, and so on. This results in 
a new set of monochrome images in which most of the information is in only one or two 
bands and the other bands an car very nearly uniformly grey. Data compression is 
accomplished by quantizing most finely the data in the important bands and either dis- 
carding or coarsely quantizing f he remainder. 

Transform Coding 

Typical mathematical transformations used for the above processes are the Fourier, 
Hedamard, and Karhunen-Loeve. Each has its advantages and disadvantages in terms of 
performance and complexity of implementation. The transform process can also be 
carried cut one or two dimensiorally in a monochrome image with compression taking 
place via quantization of the coefficients in the transform domain. The data com- 
pression literature shows this to be the preferred techn gue for compressed data rates 
less than ?. bits per pixel. 
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Differential Pulse Code Modulation (DPCH) 

There are many variants of OPCM, but ail utilize a predicted value X which is a 
function P{*) of past data, a suotractor, a quantizing function, and a coding function 
as shown in Figure 6. Reconstruction, in the absence of channel errors, is accomplished 

A 

by decoding the channel symbol stream and reproducing X by duplicating the predictor 
function. The only distortion in DPCM is introduced by the quantizer whicn utilizes 
only m levels to represent the error signal c, and thus establishes a bit rate of 
Ic^m bits per pixel.* In general, DPCM produces superior reconstructed images at 
rates greater than 2 bits per pixel and is preferred to transform coding in that range. 
An additional attraction is its simplicity of implementation favoring on-board 
applications. 
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Figure 6. Generic DPCM Encoder and Receiver 


Hybrid Trans form/ DPCM Techniques 

In addition to DPCM and transform coding techniques, hybrids of the two have 
shown good performance for both intra- and inter-frame coding of television signals. 
These systems as applied to MSS data can be divided into three categories: 

1) Systems that perform a two-dimensional transform in each band of the 
MSS data and use a bank of DPCM encoders to process the transformed 
coefficients across the individual bands 

2) Systems that take a transform across the spectral bands and use different 
two-dimensional DPCM encoders to compress the data in each transformed 
band 


The well known delta modulators are defined as DPCM with m = 2; i.e., 1 bit per pixel. 
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3) Systems that follow a transformation across the spectral bands by a second 
transform in the horizontal direction (scan direction) and a DPCM encoder 
In the vertical direction separately for each transformed band. 

Techniques 2) and 3) are shown in this report to hold good promise for earth resources 
data applications. 

Cluster Coding of Hultispectral Data 

For many classification applications, multi spectral data is first used to obtain 
a clustered image as described previously. This presents an alternate approach to 
compressing the bandwidth of multispectral imagery consisting of Classifying the 
multi spectral data on-board the space vehicle and transmitting only the clustered 
imagery. That is, at each pixel location, instead of transmitting the multidimensional 
vector, only the cluster number to which the pixel belongs is sent. Subjectively, the 
reconstructed multispectral images are very good but any single band image can be 
heavily degraded. Thus cluster coding is excellent for some users and not acceptable 
for others. Different approaches to this method of bandwidth compression have 
appeared in the rer ' ; literature. 


5. STUDY RESULTS 


5.1 LITERATURE SURVEY 

As a beginning point for the study, some 150 references relevant to d'' ’ spres- 
sion of remotely sensed earth resources imagery were compiled. Each reference includes 
keywords and a brief description of its contents. This bibliography is included as an 
appendix of the Final Report. The literature survey is broken into three areas con- 
sisting of: 

e Requirements, methods, and applications of remote sensing 

a Data compression techniques relevant to imagery 

a Techniques and applications of automatic classification to remotely 
sensed imagery. 

Twenty-two generic classes of compression techniques were identified as relevant to 
the study. They fell into the four general categories of transform coding, predictive 
coding, entrophy coding, and cluster codinq. 

5.2 MULTISPECTRAL SCANNER DATA CHARACTERISTICS 

The four bands of a typical LANDSA1 MSS image are shown separately in Figure 7. 
This 256 x 256 pixel image was tGceh O: the Imperial Valley in California and 

includes agriculture, mountains, dtser ■ id wew. It is a good test image for data 
compression because it has both very high (fit.d boundaries and river) and low (desert) 
spatial frequencies. Measurements reveal that the da* a in each band has an exponential 
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NORMALIZED CORRELATION 


spatial correlation in both the horizontal (across track) and vertical (along track) 
directions. This indicates that one and two-dimensional DPCM should perform well. 
Furthermore, the spatial correlation remains exponential after application of any of 
the popular unitary transforms (KL, Fourier, Hadamard, Haar, or Slant). Figure 8 
shows the spatial correlation in each band of the LANDSAT image after a Haar transform. 



2 4 6 8 10 12 14 16 

SEPARATION DISTANCE IN PIXELS 


Figure 8. Spatial Correlation of Transformed Bands after Haar 
Transform of Multispectral Image 

Spectral correlation, however, does not vary exponentially from band-to-band. 
Indeed the various bands show similarities and differences which are not usually 
found in other applications such as color television. Table 1 shows a typical spec- 
tral correlation measured for the LANDSAT image of Figure 7, and may be summarized as: 

e Large positive correlation between the red ( MS S - 5 ) and green (MSS-4) boards 

• Large positive correlation between the two infrared bands (MSS-6 
and MSS-7) 

• Small negative correlation between the red and the two infrared 
bands. 

While these correlations apply on the average, they do not apply for many particular 
object classes - water being one counterexample . 
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Table 1. Spectral Correlation Matrix of the 
Representative MSS-UNDSAT Data 



RED 

GREEN 

IRI1 

IR#2 

RED 

1.000 

.866 

.276 

-.141 

GREEN 

.866 

1.000 

.288 

-.177 

IR#1 

.7 76 

Pflfl 

1 .000 

.654 

IR#2 

-.141 

-.177 

.654 

1.000 


5.3 SELECTION OF CANDIDATE TECHNIQUES 

In order to arrive a: a smaller list of specific compression techniques, the 22 
generic classes from the literature survey were examined for their applicability to 
mul tispectral scanner generated data. Specific attention was paid to how each would 
perform on data with the spatial and spectral correlations just described. The fol- 
lowing techniques were found to be viable candidates: 

1) Three-dimensional transform coding with block quantization using 
either the Fourier, Hadamard, Slant, or Cosine transforms. 

2) Mixed three-dimensional transforms using the optimal Karhunen-Loeve 
(K-L) transform across the spectral bands and either the Fourier, 

Hadamard, or Cosine transform two dimensionally in each transformed 
band. 

3) Two-dimensional spatial transform (either Fourier, Cosine, or 
Hadamard) followed by DPCM from band-to-band, separately for each 
transform coefficient. 

4) Optimal K-L transform across the spectral bands and two-dimensional 
DPCM in each transformed band. 

5) Optimal K-L transform across bands and hybrid coding (Cosine/DPCM) 
in each transformed band. 

6) Cluster coding algorithms. 

5.4 EXPERIMENTAL RESULTS 

The above candidate methods were then subjected to both theoretical and experi- 
mental analysis in the data rate ranges of 1/2 to 2 bits per pixel. A very complete 
comparison of computational complexity was performed and sensitivity to sensor effects 
estimated. The criteria of optimality discussed in Section 4 were used. Based on 
these evaluations and simulations, a set of three recommended techniques was derived: 

1) K-L transform across the spectral bands followed by two-dimensional DPCM in 
each transformed band (denoted KL-2D DPCM) 
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2) K-L transform across the spectral bands followed by the hybrid 
Cosine transform (across track or horizontally) and DFCM on the 
transform coefficients (along track or vertically) 

3) The cluster coding technique. 

Although future sensors may exhibit different data statistics, this selectic.. win, 
with high probability, not change. Only specific parameters of each technique need 
be optimized. 

The KL transform requires extensive computation because It is data dependent. 

For the spectral transform it may, however, be replaced with the deterministic Haar 
transform with little loss in fidelity. Similarly, in 2), the much simpler Hadamard 
or Slant transforms can replace the Cosine spatial transform. It was found that elim- 
inating the spectral transformation altogether in 1) and 2) increases the MSE by a 
factor of about 1.4. 

5.4 COMPARISON OF THE RECOMMENDED TECHNIQUES 

In the following the three recommended techniques are compared under various cri- 
teria of optimality such as MSE, recognition accuracy, and computational complexity. 

The performance of methods 1) and 2) is also evaluated in the presence of channel 
noise. 

5.4.1 Comparison of MSE and Signal -to-Noise Ratio Performance 

Figure 9 shows the performance (i.e., bit rate vs distortion) of the three tech- 
niques. Signal-to-noise r.:.tio is used as a criterion of distortion. It is directly 
related to MSE as given. It is seen that the KL-2D DPCM encoder performs significantly 
worse than the KL-Cosire-OPCM at low bit rates but their performance is similar at 
higher bit rates (about 2 bits per sample per band). The performance of cluster cod- 
ing is just the opposite. It is superior to the others at low bit rates and inferior 
at high rates. Figure 9 also shows the performance of the KL-Cosine-DPCM and KL-2D 
DPCM encoders in concatenation with a variable rate (entropy) encoder. The optimal 
zero-order Huffman code is used which reduces the output bit rate by assigning short 
code words to the more probable quantizer output levels and longer words to the less 
probable. The effect is more dramatic (average 25% improvement) on the KL-2D DPCM 
than on the KL-Cosine-DPCM {10% improvement). Being an information preserving code, 
this technique lowers the rate while adding no distortion at a modest cost in 
complexity. 

5.4.2 Comparison Based on Recognition Accuracy 

Fiqure 10 shows the performance of the three recommended bandwidth compression 
methods in preserving the classification accuracy of the reconstructed multispectral 
imagery. Using the criterion defined in Section 4.2, the comparative performance of 
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Figure 9, Bit Rate vs Signal -to-Noise Ratio 
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CLASSIFICATION CONSISTENCY (PERCENT OF PIXELS WITH UNCHANGED CLUSTER ASSIGNMENTS) 


Figure 10. Bit Rate vs Classification Consistency 

the KL-2D DPCM and KL-Cosine-DPCM is basically the same as when using signal-to-noise 
ratio. This suggests a strong correlation between MSB and classification accuracy, 
as performance criteria for selecting compression techniques. 

Surprisingly, the classification accuracy of cluster coding decreases with 
increasing rate over a large range of rates. This behavior is due to the cluster coding 
procedure using a fixed size block (16 x 16 pixels) and varying the number of clusters 
per block while classification accuracy is measured with a 256 by 256 pixel block. 

100% classification accuracy would result if the block sizes were the same. The use 
of large block sizes in mul tisnectral image classification depends very much on the 
user and application. Large block sizes are likely to be used where one is only 
interested in a limited number of objects in a large segment of imagery, as in 
hydrology and urban planning. The results shown here for cluster coding are thus 
largely a result of the technique chosen for comparison and may be better or worse 
in a specific application. 
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5.4.3 Comparison of System Complexity 


The computational complexity of the three recommended techniques Is summarized 
parametrically In Table 2. The results are in terms of the number of computations 
and required memory per pixel. This table also shows typical numbers for four-channel 
(e.g., LANDSAT) data and the block sizes used In the simulations. An analysis of the 
complexity of the cluster coding method was not performed. However, based on the com- 
puter time required for cluster coding the LANDSAT data (Figure 7) and comparing It 
witn t.hp computer time required for the KL-2D bPCM, It was concluded that the cluster 
coding algorithm is about one order of magnitude more complex. 

Table 2. Computational Complexity of the Three 
Recoirmended Techniques 



Computational Requirements 
(Number of Operations per Pixel) 

Storage Requirements 

Compression Technique 

Adds (or subtracts)/ 
Pixel 

Multiplies/ 

Pixel 

Record 

Storage 

Memory 

3-Dimensional Hadamard 

Log 2 B 2 S ♦ T 

1 

16BSW 

! 

44B 2 S + 8000 ! 

Transform and Block 
Quantization 

(Typically 11) 

(Typically 1 ) 

(Typically 
3,300,000 bits) 

(Typically 

54.000 bits) j 

KL-7D DPCM 

S + T + 3 

S + 4 

16SW 

16S 2 + 148S + P000 


(Typically 8) 

(Typically 8) 

(Typically 
200.000 bits) 

(Typically 

9000 bits) i 

.. i 

KL-Hadamard DPCM 

Log 2 B + S + T + 1 

S + 2 

32SW 

i 

84BS + 32B + 32$(S + 1 ) *8000 


(Typically 8) 

(Typically 6) 

(Typically 
400,000 bits) 

(Typically 15,000 bits) 


PARAMETER 


TYPICAL VALUES 


B * Block size 16 

T 3 Bits/ptxel 1 

S a Spectral Bands 4 

W - Sample in each scan 3200 


5.4.4 Comparison of Channel Noise effects 

Both signal -to-noise ratio and classification accuracy are degraded by intoduction 

of a noisy transmission channel between the compressor and reconstructor. The results 

shown in Figure 11 refer to a binary symmetric channel with bit error rates ranging from 
-2 -4 

10 to 10 and a compressed data rate of 1 bit per pixel. Both algorithms degrade 

3 

significantly at higher bit error rates (> 10" ) with the Haar-20 DPCM somewhat worse. 

-3 -4 

Figure 12 shows the reconstructed infrared band (MSS-6) at BER's = 10 and 10 after 

compression to 1 bit per pixel by the Haar-2D DPCM. At this rate, only a few errors 
-4 

are expected at 10 but propagation of channel error effects is clearly visible at 
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Figure 11. Si gnal -to-’ioi se Ratio and Classification 
Consistency vs Bit Error Rate 
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10' 3 . This Is still less significant than if simple two-dimensional DPCM were used 
because, with the Haar, an error occurring in one transformed band is redistributed 
amongst all spectral bands upon inverse transforming at the reconstructo* - . Although 
this leaves the total error unchanged, it distributes it amongst all spectral bands and 
makes both individual bands and the composite less objectionable. 

5.4.5 Subjective Comparison of Reconstructed Imagery 

Much of the user analysis of multlspectral imagery is still performed using color 
composites. For this reason, a large number of techniques were compared at different 
data rates using reconstructed hard-copy images. Figures 13 through 15 show the individual 
reconstructed bands of KL-2D DPCM at 2 bits per pi. -el (1.5 bits per pixel with variable 
length encoding), the Haar-Cosine-DPCM at 1 bit cer pixel, and cluster coding at 1 oit 
per pixel. The following observations are made here and expanded upon in the Final 
Report : 

• Comparison with the originals shows very little degradation at 2 bits 
per pixel 

• Subjectively, the individual bands from the cluster coding method are 
inferior to the Haar-Cosine-DPCfl 

• The subjective quality of the color composites for cluster coding 
is superior to the Haar-Cosine DPCM mainly because cluster coding 
does not inherently alter spatial frequencies. 
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6. RESULTS AND CONCLUSIONS 


The major study goal of providing tradeoff data for future NASA program planning 
was accomplished. Detailed results are provided In the Final Report with numerous 
charts and tables. An outstanding conclusion of the study Is chat compression ratios 
of 3-4 to 1 are achievable with very little distortion on multi spectral earth resources 
imagery data. This is readily seen in Table 3 which shows negligible distortion for 
almost all conceivable applications at 2 bits per pixel data rates. The table also 
shows that by choosing the proper compression technique, a graaual degradation in com- 
plexity can be achieved at lower rates but at an increasing cost in complexity. 


Table 3. Selection of Compression Techniques 


Comprassion 

Ratio 

Bit R?ta 

Racommandati 

Tachniqua 

Typical 
SNR (MSE) 

Typical 

Classification 

Consistancy 

Relativa 

• Computational 
Complaxity 

3.35 to t 

2 fiits/Pixtl 

Haar-2D DPCM 

35dB (5) 

90% 

1 

5 to 1 

0.10 L*iti/Pix«1 

Haar-Cosina DPCM 

32.5dB (9) 

86.5% 

2 

6.75 to 1 

! oit/riA?l 

Haar-Cosina DPCM 
with Huffman 

31.5dB (11) 

83% 

4 

11 to 1 

0.625 Bit/P.xal 

Clustar Coding 

29dB (20) 

71% 

10* 


"Estimated 


Other important conclusions which may be drawn from Table 3 are: 

1) Fixed rate encoders given excellent quality imagery at 3-4 to 1 com- 
pression ratios. This compression ratio may be increased to 5 to 1 
by making the system operate at a variable rate with a Huffman 
encoder. Subjectively, reconstructed images are almost indistin- 
guishable from the originals. 

2) At a fixed bit rate and a compression ratio of almost 7 to 1, the 
Haar-Cosine-DPCM system gives acceptable results. Cluster coding 
techniaues can be used to obtain a compression ratio of 11 to 1 wi*-h 
possibly acceptable performance. Compression ratios of 13 to 1 and 
higher correspond to noticeably degraded imagery which would prob- 
ably be acceptable to only a few users. 

3) The results obtained with LANDSAT imagery are corroborated by quite 
different 12-channel MSS aircraft gathered data. Thus the contusions 
are felt to be quite general. 

The cost and complexity of applying data compression will vary depending on the 
particular sensor and application. Nevertheless, the inescapable conclusion is that 
NASA should give serious consideration to data compression in future programs. 
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7. RECOMMENDATIONS 


The fundamental conclusions of the study lead to the strong recommendation that 
NASA give serious consideration to on-board data compression for future earth resources 
missions. There is a high probability that its use will result in cost savings for 
major programs. Specific suggestions for future NASA activities in the data compres- 
sion area are as follows: 

1) A prototype of a data compression unit is recommended to both dem- 
onstrate the concept ana provide a body of test data for evaluation 
by data users. A specific suggestion is to build a Haar-2D D Dr M 
unit t.o be integrated with an operational multispectral scanner 
such as the thematic mapper now under development by NASA. Air- 
craft flight testing could then be performed to demonstrate a 4 or 
6 to 1 compression of data either on tape or on a data link and 
compare the uncompressed data for user acceptability. 

2) Perform system application studies to demonstrate cost savings by 
using representative earth resources projects such as LANDSAT or 
Eartn Observatory Satellite (EOS). The benefits of data compres- 
sion for EOS were stated in Section 1. Using the Haar-2D DPCM tech- 
nique at a compression ratio of 6 to 1 will reduce the bit rate for 
this system from 120 Mbps to 20 Mbps. Extrapolating the LANDSAT 
results to the six-band thematic mapper, the bandwidth compression 
system gives a signal -to-noise ratio of about 32 dB with a class- 
ification consistency of over 35^. 

3) Perform user acceptance studies. ’’his is of major importance in 
the evaluation of bandwidth compression methods since at present 
it is not known just which reconstructed images are acceptable to 
which users. This can be performed by selecting current users of 
multispectral data and verifying th e effects of compressed data on 
their standard processing. 

4) Study the use of adaptive compression techniques for greater com- 
pression ration. 

5) Further study of the cluster coding method is recommended to deter- 
mine its implementational feasibility and to optimize its performance. 
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